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Long-term follow-up information on children who have participated in early childhood special edu- 
cation (ECSE) has seldom been available. In the present study, the cognitive and academic perfor- 
mance of 171 thirteen-year-old graduates of 2 ECSE curricula is examined. Although preschool 
cognitive measures continued to predict later performance significantly, there were numerous exam- 
ples of “errors of prediction,” that is, children whose later academic performance was substantially bet- 
ter or worse than predicted on the basis of a regression equation. Six individual and family factors were 
individually at least moderately associated with outcome status and collectively strongly correlated 
with status as “improver” or “decreaser”: positive temperament, European American ethnicity, male 
gender, middle income, initial referral for gross motor concerns, and nonreferral for social concerns. 


How much does the past predict the future in human devel- 
opment? In particular, how powerful and enduring is the im- 
pact of early adverse experience, including poverty, illness, 
abuse or neglect, or poor early development? These questions 
can only be addressed with longitudinal studies, because in- 
dividuals differ not only in their level of performance at any 
given point in time but also in the shape of their developmental 
trajectory. Conventional standardized test scores and statistics 
generally assume a linear growth model, but nonlinear growth 
patterns in mental development are commonly observed in lon- 
gitudinal growth studies (e.g., McCall, Appelbaum, & Hog- 
arty, 1973). In this article, we address one aspect of nonlinear 
trajectory: children whose later performance is either better or 
worse than would have been predicted based on early perfor- 
mance. 

The great majority of research on long-term outcomes 
for children with early adverse experience has been conducted 
with typically developing children identified as being at risk 
for biological (e.g., prematurity) or environmental (e.g., pov- 
erty) reasons. Much less is known about long-term outcomes 
for children identified on the basis of poor performance in 
early childhood. Several sets of results suggest that focusing 
on the impact of child, family, and educational variables on 
these children would be highly illuminating. First, evidence 
from studies of compensatory education, for children grow- 
ing up in poverty, suggest that long-term effects can occur. 


and they may be different from short-term outcomes (Lazar 
& Darlington, 1982). Second, research on risk factors in de- 
velopment suggests that no one environmental or organismic 
factor has a strongly determining role in later outcome but the 
presence of multiple risk factors greatly increases the proba- 
bility of an adverse outcome. For example, Rutter et al. (1975) 
found that the probability of a psychiatric disorder in child- 
hood was strongly correlated with the number of the follow- 
ing risk factors present: family discord, parental criminal or 
psychiatric involvement, social disadvantage, and poor school 
environment. Third, there is evidence of substantial individ- 
ual differences in children’s vulnerability to risk factors, so 
much so that the term resilience is often invoked. Werner and 
Smith (1982) noted that even among their sample of toddlers 
with substantial risk due to poverty, low parental education, 
prematurity, or other factors, the rate of developmental diffi- 
culties was greatly reduced by the presence of a number of 
positive circumstances, such as availability of alternate care- 
givers, at least 2 years’ time between children, a workload for 
the mother that was not excessive, a cohesive family, and a 
multigenerational network of kin and friends during adoles- 
cence. Other research (see Masten & Coatsworth, 1998, for a 
review) points to the role of an easy temperament; an outgo- 
ing, social personality; and a sense of humor as positive fac- 
tors. Some of the factors on Werner and Smith’s list may well 
be facilitated by such characteristics in children. 
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Research on risk and protective factors in children at risk, 
together with the smaller body of research on children with 
demonstrated poor cognitive and linguistic skills, suggests 
that the nature of those factors varies with the developmental 
level of the child and the nature of the poor early performance 
(Masten & Coatsworth, 1998; Rutter, 1990; Werner, 1999). 
For example, in very early childhood, a close relationship with 
a caring parent or other caregiver is important, whereas in later 
childhood, a positive, outgoing personality and good relation- 
ships with peers play an increasingly important role. Later 
still, individual special talents — what Brooks (1999) called 
“islands of competence” — may have a substantial effect (Mas- 
ten & Coatsworth, 1998; see also Werner, 1999). Werner ob- 
served that the most important predictive factors for children 
with learning disabilities and attention-deficit/hyperactivity 
disorder differed by gender: For girls, low birthweight and 
family socioeconomic status (standard of education, maternal 
education) were the most important, whereas for boys, peri- 
natal stress and high activity level, along with family stabil- 
ity, were important at 1 year of age. In contrast, for children 
with mild mental retardation, the most important predictors, 
beyond the severity of the initial delay, were family stability 
and early caregiving environment during the first decade of 
life, but personality and motivational factors (being upbeat, 
prompt, and staying on task) became much more important 
after that (Werner, 1999). Thus, it would be premature to as- 
sume that the findings on protective factors based on children 
at risk, or at one particular developmental level, will general- 
ize to children at other levels or to those identified by poor 
early performance. 

The Longitudinal Comparison Project is a uniquely 
long-term follow-up study of a group of children who partici- 
pated in an early childhood special education program as pre- 
schoolers. They were randomly assigned to one of two 
different programs, Direct Instruction or Mediated Learning, 
for 1 or more years as preschoolers and have been followed 
with annual evaluations since that time. Most are now in their 
late teens. The overall design and results of this project are re- 
ported in Dale and Cole (1988); Cole, Dale, Mills, and Jenk- 
ins (1993); and Mills, Dale, Cole, and Jenkins (1995). Very few 
main effects of the program have been observed, but there has 
been a continuing series of significant aptitude-by-treatment 
interactions. Students who initially performed at a lower level 
have tended to do better if they were in the Mediated Learn- 
ing program, whereas students who initially performed at a 
higher level have tended to better if they were in the Direct In- 
struction program. 

The starting point for the present investigation was the 
continuing pattern of substantial correlations, from early cog- 
nitive measures to later cognitive and academic measures, de- 
spite differences in program and life histories. For example, 
the multiple correlation for the prediction of academic per- 
formance at age 13 from preschool entry scores for the Mc- 
Carthy Scales of Children’ s Abilities General Cognitive Index 


(McCarthy, 1972) and the Peabody Picture Vocabulary Test- 
Revised (PPTV-R; Dunn & Dunn, 1981) was .70. A correlation 
of this magnitude could be indicative of the intervention’s 
strong but constant effect. However, the fact that it holds 
across a sample of two groups that experienced quite differ- 
ent interventions and were composed of individuals who had 
enormously diverse lives after preschool strongly suggests a 
different and somewhat discouraging conclusion — that envi- 
ronment has had only a modest effect. However, a correlation 
of .70 is far from perfect. We believe that the high correlation 
in fact makes the study of individuals who diverge from the 
projected path all the more interesting. In this study, we iden- 
tified a group of students who diverged in either a positive or 
a negative direction and then looked at a number of possible 
organismic, family, and school influences on later academic 
achievement. We also explored the relationship between these 
divergences in academic development and cognitive develop- 
ment as assessed by intelligence tests and school placement 
experience. 

Method 

Only a summary of the intervention phase and follow-up phase 
of this project is provided here; more complete information is 
provided in Dale and Cole (1988), Cole et al. (1993), and Mills 
et al. (1995). 

Participants: Intervention Sample 

Over a 4-year period, 206 children were provided early inter- 
vention services in a laboratory school located in the United 
States. Students were eligible for special education services 
according to state guidelines. Upon entry in the program, their 
mean age was 4.9 years and their mean IQ was 76.7, as mea- 
sured by the McCarthy Scales of Children’s Abilities. The 
participants were 142 boys and 64 girls. The sample was 
composed of 120 European American children; 66 African 
American children; and 20 children who were Hispanic, Pa- 
cific Islander, Asian, Native American, or other. In the state 
in which the study was conducted, children qualify for special 
education if they exhibit a delay of at least 1.5 standard devi- 
ations on a normed measure in two or more of five areas (lan- 
guage, gross motor, fine motor, cognitive, or social-emotional 
development) or if they exhibit a delay of at least two stan- 
dard deviations below the mean in one of these areas. 

Classes and Assignment for Intervention 

Preschool participants (entering ages between 3 years and 
5 years 1 1 months) attended class 2 hours per day, 5 days per 
week, for 180 school days. There were six preschool classes 
per year, three for each of the two programs, with 12 students 
in each class. One of the three classes for each program con- 


THE JOURNAL OF SPECIAL EDUCATION VOL. 37/NO. 4/2004 239 


tained four typically developing students and eight children 
with disabilities. The other two classes for each program in- 
cluded only students with disabilities. Kindergarten children 
attended class 5.5 hours per day, 5 days per week, for 180 
school days. There was one such class for each program, with 
14 students per class. Students were randomly assigned to a 
program, and to a classroom within the program. 

During the intervention phase of this project, children 
participated in Direct Instruction (DI) or Mediated Learning 
(ML) programs. The DI program was based on the educa- 
tional approach of Englemann (1980). Instruction was explicit, 
teacher-directed, and fast-paced, with specific procedures for 
error correction, all directed toward very detailed academic 
learning outcomes. The specific curricula used for this program 
were Distar Language I and II (Englemann & Osborne, 1976), 
Distar Arithmetic I and II (Engelmann & Gamine, 1975), and 
Distar Reading I and II (Engelmann & Bruner, 1974). The ML 
program (Osborne & Sherwood, 1984), derived from the work 
of Eeuerstein (Eeuerstein, Rand, Hoffman, & Miller, 1980), 
emphasized cognitive rather than academic learning. Children 
were taught a variety of thinking strategies, including plan- 
ning ahead, distinguishing the relevance of information, gen- 
erating multiple solutions, and evaluating their performance. 
To promote generalization, teachers followed the child’s lead 
rather than presenting prescribed materials, elicted responses 
rather than modeling for imitation, and dispersed instruction 
throughout the day rather than confining topics to time blocks. 
Further information about these programs and about fidelity 
of implementation is provided in Cole et al. (1993) and Notari- 
Syverson, Cole, Osborne, and Sherwood (1996). 

Intervention Phase Measures 

Children participated in the intervention phase for 1 or more 
years. In the present report, the following subset of measures 
obtained during each child’s 1st year were used. Pretests were 
administered from October through December, and posttests 
from May through August; the average time between tests was 
approximately 8 months. 

McCarthy Scales of Children’s Abilities. The McCarthy 
scales yield a general measure of intellectual functioning (the 
General Cognitive Index [GCI]), as well as more specific mea- 
sures of verbal, perceptual performance, short-term memory, 
quantitative abilities, and motor coordination. It covers the 
developmental range from 2'A to EA years. The test is well 
standardized and psychometrically sound, with split-half and 
test-retest reliabilities approximating .9 for the GCI (Sattler, 
1988). 

PPVT-R. The PPVT-R was utilized as a major measure 
of language development, because of its excellent standard- 
ization, reliability (split-half and test-retest coefficients of 
approximately .7-.8 in the preschool age range), and validity 


(Sattler, 1988). Although the test measures only receptive vo- 
cabulary, using a picture-pointing format, it is substantially 
correlated with a variety of other language measures. Further, 
some form of vocabulary test is included in nearly all intelli- 
gence tests, permitting an evaluation of the prediction based 
on early vocabulary skills. 

Participants: Follow-Up Sample 

After the students graduated from the intervention phase of 
the study, we followed up with them at 1-year intervals and 
gave them a battery of tests that varied with the age of the stu- 
dent. For the present study, we examined the students’ test 
scores from age 13, approximately 8 years after the children 
entered the intervention. This age was selected because it was 
the age at which the largest number of students had been 
tested. A total of 171 students received the battery at age 13. 
This group was composed of 86 students in the ML curricu- 
lum group and 85 in the DI curriculum group. Of the re- 
maining children, some had not yet reached 13, a few reached 
the age of 13 before funding was available for follow-up test- 
ing, or their families could not be reached or did not agree to 
participate. 

Because children left the intervention at different ages, 
the number of years that had passed since beginning the inter- 
vention varied for students at a given age during the follow- 
up period; however, the two groups were comparable on this 
measure. Table 1 includes these and other descriptive measures 
for the sample. A series of t tests confirmed no significant dif- 
ferences on any of these measures. 

Follow-Up Measures 

Peabody Individual Achievement Test (FIAT). The 
PIAT (Dunn & Markwardt, 1970) was selected as the primary 
academic measure because of its good standardization, design 
as an individually administered achievement test, wide range 
of subtests and age of applicability, and suitability to measure 
changes in general cognitive processes that DI is designed 
to facilitate (Sattler, 1988). Median test-retest reliabilities of 
.89 for the total score have been reported (Sattler, 1988), along 
with substantial correlations with other achievement tests. Al- 
though a revision of the PIAT is now available, the annual mea- 
sures of academic achievement used in this study mandated 
continuation of use of the original PIAT. 

Stanford-Binet Intelligence Scale-Fourth Edition 
(SBIS-rV). The SBIS-IV (Thorndike, Hagen, & Sattler, 1986) 
was selected as the primary cognitive measure because of its 
excellent standardization, good reliability and validity, wide 
range of subtests, applicability over a broad age range, and suit- 
ability to measure changes in general cognitive processes that 
ML is designed to facilitate (Sattler, 1988). Both internal con- 
sistency and test-retest reliability coefficients are above .9. 
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TABLE 1. Composition of Follow-Up Sample at Age 13 by Program 



Direct instruction^ 

Mediated learning^’ 

Overalb 

Description 

n M SD 

n M SD 

n M SD 


Sex 


Boys 

59 



58 



117 



Girls 

26 



28 



54 



Ethnicity 










European American 

42 



56 



98 



African American 

36 



21 



57 



Other 

7 



9 



16 



Age at pretest 


4.82 

.82 


4.75 

.91 


4.79 

.86 

Entering McCarthy GCI 


78.00 

17.40 


76.70 

16.20 


77.40 

16.80 

Years in intei'vention 


1.69 

.72 


1.72 

.81 


1.71 

.76 


Note. McCarthy GCI = McCarthy Scales of Children’s Abilities General Cognitive Index Scores (McCarthy, 1972). 
= 85. 1’n = 86. S = 171. 


The SBIS-IV was administered approximately every 3 years; 
the results from age 12 were used in the present study. 

Identification of Participants with 
“Errors of Prediction” 

As shown in Table 2, both entering McCarthy GCI scores and 
PPVT-R standard scores predicted age 13 and earlier PIAT 
total scores, as well as age 12 SBIS-IV scores. A multiple- 
regression analysis confirmed that both scores contributed 
unique variance to the prediction. The multiple correlation for 
the combined prediction was .70, representing 49% of the 
total variance. 

Next, a single multiple-regression analysis was per- 
formed, using both preschool-phase measures (McCarthy GCI 
and PPVT-R standard score) to predict the PIAT total score for 
the entire sample for whom all relevant measures were avail- 
able. As part of the analysis, standardized residuals — the dif- 
ference between the predicted and actual PIAT total scores, 
(standardized toM-Q, SD = 1 for the sample) — were computed. 
Positive residuals indicated relatively better performance than 
predicted by the overall regression formula, whereas negative 
residuals indicated relatively poorer performance than pre- 
dicted. Standardized residuals of +/- 1 .5 were interpreted as 
errors of prediction. All participants (« = 14) for whom the 
standardized residual exceeded 1.5 were identified as im- 
provers, whereas those whose standardized residual fell be- 
low -1.5 (n = 9) were identified as decreasers. A third set of 
23 participants was identified as a comparison group. Each 
participant in the improver and decreaser groups was matched 
to a comparison participant on the basis of GCI scores (within 
2 points) from the end of the 1st year and an error of predic- 
tion of less than 1 standard deviation. Within these constraints, 
participants were matched as closely as possible for entering 
GCI. As shown in Table 3, the three groups were quite com- 


parable with respect to preschool-phase GCI scores but di- 
verged substantially in age 13 PIAT total scores. Follow-up 
Tukey HSD tests revealed that the improver group scored sig- 
nificantly higher than the comparison group, which in turn 
scored significantly higher than the decreaser group. The 
groups did not diverge on SBIS-IV scores, however. 

Child, Program, and Family Measures 

A number of measures were evaluated as predictors of outcome 
category: gender, ethnicity, preschool program (Direct Instruc- 
tion vs. Mediated Learning), age at entry to preschool program, 
number of years in the program, income (based on application 
for free or reduced-cost lunch during the preschool period), 
and initial referral categories for special education (six nonex- 
clusive categories: language, cognition, gross motor develop- 
ment, fine motor development, social behavior, and medical 
concerns). In addition to these measures that had been previ- 
ously collected, three additional measures — temperament, fam- 
ily stability, and parental advocacy — were obtained. These 
measures were identified on the basis of previous research on 
resilience, particularly from Werner and Smith (1982), and 
conversations with project staff members about what they re- 
membered about participants across this considerable time 
span. To minimize the possibility of bias from staff members’ 
memories of actual test performance, the following two-part 
procedure was used. Two long-term members of the research 
project staff with substantial experience in testing and inter- 
viewing the children and interviewing the parents were given 
the list of 46 names of participants in the improver, decreaser, 
and comparison groups but were not told their group status. 
They were asked to write as much information about the child 
and his or her family as possible. Issues of temperament, fam- 
ily stability, and parental advocacy were specifically men- 
tioned, but the staff members were encouraged to provide any 
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TABLE 2. Prediction (Pearson Correlations) of Ages 10 Through 
13 PIAT Total Scores and Age 12 SBIS-IV 




PIAT total score^ 


SBis-rv 

Predictor 

10 yrs 

11 yrs 

12 yrs 

13 yrs 

12 yrs 

McCarthy GCI 

.56* 

.56* 

.54* 

.57* 

.60* 

PPVT-R standard score 

.42* 

.45* 

.36* 

.39* 

.42* 


Note. PIAT = Peabody Individual Achievement Test (Dunn & Markwardt, 1970); SBIS-IV = Stanford-Binet Intelligence 
Scales-Fourth Edition (Thomdyke, Hagen, & Saltier, 1986); McCarthy GCI = McCarthy Scales of Children’ s Abilities General 
Cognitive Index scores (McCarthy, 1972); PPVT-R = Peabody Picture Vocabulary Test-Revised (Dunn & Dunn, 1981). 

^/7s for these correlations range from 134 to 171. 

*/ 7 < . 001 . 


TABLE 3. Comparison of Outcome Groups on Intervention Phase McCarthy GCI, Age 13 PIAT Total Scores, 
and Age 12 Stanford-Binet Test Composite 




Group 




Improver 

Comparison 

Decreaser 


Measure 

M SD 

M SD 

M SD 

ANOVA 


McCarthy GCP (entering) 

80.2 

14.2 

80.7 

12.9 

78.6 

10.7 

F(2, 43) = .09, ns 

Age 13 PIAT totaP 

108.1 

7.0 

84.8 

10.9 

68.7 

4.7 

F(2, 43) = 57.6, p < .001 

Age 12 SBIS-IV compositea’t> 

74.6 

18.1 

76.4 

18.1 

82.1 

8.5 

F(2, 35) =A4,ns 


Note. McCarthy GCI = McCarthy Scales of Children s Abilities General Cognitive Index scores (McCarthy, 1972); PIAT total = Peabody Individual Achievement Test 
total score (Dunn & Markwardt, 1970); SBIS-IV composite = Stanford-Binet Intelligence Scale-Fourth Edition composite score (Thorndike, Hagen, & Saltier, 1986). 
^All scores are standard scores, M = 100, SD = 15. ^SBIS-IV scores were not available for all participants. 


other information about the child and family they could re- 
member, in case other factors emerged as relevant (though 
none with enough frequency to be analyzed did emerge). The 
narratives were then independently coded by individuals with 
training in child development who were unfamiliar with either 
the families or the original testers. After some preliminary in- 
vestigations, the coding scales proved reasonably reliable. (The 
coding system is detailed in the Appendix.) 

School Placement 

Approximately 75% of the students in this project attended a 
single urban school district for at least part of their education. 
With the consent of parents and/or adult students, we obtained 
special education history information from the school district. 
Because of moves in and out of district and because the rele- 
vant records stretched back to the late 1980s, with several 
classification system changes having occurred since then, the 
information was less complete than we had hoped for. We had 
hoped to be able to identify students who had been placed in 
general education, minimal special education (consultation or 
research room), and self-contained special education. How- 


ever, due to ambiguities in the records, we were limited to dif- 
ferentiating between general and special education placement 
for each academic year. 

Results 

Longitudinal Trends in Mean 
PIAT Performance 

For the present report, PIAT total scores at age 13 were used to 
identify groups. It is of interest to examine longitudinal trends 
in PIAT scores to determine if the scores at age 13 adequately 
reflect overall performance and to examine the nature of de- 
velopmental change. Figure 1 includes the preschool-phase 
measures, as well as PIAT total scores at ages 8 through 13 
for the improver, decreaser, and comparison groups. As the 
figure shows, the scores at age 13 reflect stable developmen- 
tal trends. (Note that the scores appear relatively high in ab- 
solute terms, reflecting the use of old norms for the original 
PIAT; a comparison of the two tests for seventh graders [Dunn 
& Dunn, 1981] revealed a 6.5-point mean difference between 
the two tests.) 


242 THE JOURNAL OF SPECIAL EDUCATION VOL. 37/NO. 4/2004 



.4 Improvers 

. . Comparison 
4 Decreasers 


Measure 


FIGURE 1. Longitudinal trends in mean PIAT performance for three outcome 
groups. Note. McCarthy GCI = McCarthy Scales of Children’s Abilities General 
Cognitive Index scores (McCarthy, 1972); PIAT = Peabody Individual Achievement Test 
(Dunn & Markwardt, 1970). 


Comparison of Prediction Groups 

Child Variables. A comparison of the three groups on 
gender, ethnicity, and preschool program variables is provided 
in Table 4. Although there was a trend for males to have more 
positive outcomes, this was nonsignificant. In contrast, ethnic- 
ity and temperament were significantly related to outcome. A 
higher proportion of European American participants were in 
the improver group, and the converse was true for the African 
American and “other” groups. With respect to temperament, 
children with a relatively positive temperament (score =1) were 
more likely to be in the improver group, whereas those viewed 
as difficult, shy, or unremarkable (score = 2 or 5) were more 
likely to be in the decreaser group. 

Table 5 presents a comparison of the groups on the basis 
of initial referral category. Although there are a number of in- 
teresting trends, particularly the findings concerning children 
referred for social reasons, the only significant difference is 
that children referred for gross motor concerns were more 
likely to be in the improver group. 

Preschool Program Variables . A comparison of the three 
prediction groups on preschool program variables is also in- 
cluded in Table 4. There was no relation between outcome and 
preschool program (DI or ML), age of entry, or the number of 
years the child was in the preschool program. 

Family Measures. Table 6 summarizes comparisons of 
the three groups on income, family stability, and parental ad- 
vocacy. There is a nonsignificant trend for lower income, as 
indexed by application for free or reduced-cost lunch, to be 


associated with outcome as a decreaser. Neither family stabil- 
ity nor parental advocacy appeared to be related to outcome. 

Risk/Protective Factors Model. On the basis of the 
above analyses, three variables appeared to be significantly 
related to outcome group: temperament, ethnicity, and referral 
for gross motor concerns. For three others there was a sug- 
gestive nonsignificant trend: sex, income, and (absence of) re- 
ferral for social concerns. Because a case can be made for the 
importance of these variables on the basis of theory and pre- 
vious research, we decided to include them in the model. A 
summary score of “protective factors” was computed by 
adding one point for each of the following traits: generally 
positive temperament, male, European American ethnicity, 
middle income, referral for gross motor concerns, and absence 
of referral for social concerns. The summary score is signifi- 
cantly associated (Spearman r = .50, p < .001) with outcome 
group treated as an ordered, three-valued variable (where 1 = 
decreaser and 3 = improver). Figure 2 illustrates the proba- 
bility of placement in the three outcome groups as a function 
of the summary score. 

School Placement 

At ages 8 through 13, school placement information was 
available for between 24 and 28 of the students. Figure 3 il- 
lustrates the proportion of students (for whom information 
was available) in the three outcome groups who had special 
education placement at each age. Students in the improver 
group were consistently less likely to be in special education 
than were students in the decreaser and comparison groups. 


243 


TABLE 4. Comparison of Outcome Groups on Child and Preschool Program Variables 


Measure 




Group 





pa 

Improver 

Comparison 


Decreaser 

n M 

SD 

n 

M 

SD 

n 

M 

SD 

Sex 









ns 

Boys 

12 


16 



8 




Girls 

2 


7 



3 




Ethnicity 









< .05 

European American 

10 


14 



2 




African American 

4 


7 



3 




Other 

0 


2 



4 




Temperament category 









< .05 

1 

7 


11 



0 




2 

5 


6 



6 




4 

0 


0 



0 




5 

0 


5 



2 




6 

2 


0 



1 




Program 









ns 

DI 

5 


10 



4 




ML 

9 


13 



5 




Age at entry 

4.93 

1.29 


5.09 

.92 


4.76 

.74 

ns 

Yrs. in program 

1.93 

.73 


1.65 

.83 


1.89 

.78 

ns 


Note. DI = direct instruction; ML = mediated learning. Temperament category; 1 = positive, 2 = not remarkable, 4 = hostile or aggressive, 5 = otherwise diffi- 
cult, 6 = diagnosed disorder or syndrome (see Appendix). 

^Chi-square test for categorical data; one-way analysis of variance for interval data. 


TABLE 5. Initial Referral Categories (Nonexclusive) as Predictors of Outcome Category 







Group 







Improver 

Comparison 


Decreaser 


Referral category 

n 

n 

(%) 

n 

(%) 

n 

(%) 

pa 

Language 

Referred 

34 

12 

(35) 

15 

(44) 

1 

(21) 

ns 

Nonreferred 

12 

2 

(17) 

8 

(67) 

2 

(17) 


Cognition 

Referred 

11 

4 

(36) 

4 

(36) 

3 

(27) 

ns 

Nonreferred 

35 

10 

(29) 

19 

(54) 

6 

(17) 


Social 

Referred 

25 

7 

(28) 

11 

(44) 

7 

(28) 

l\2) = 2.49 

Nonreferred 

21 

7 

(33) 

12 

(57) 

2 

(10) 

p < .15 

Gross motor 

Referred 

27 

11 

(41) 

14 

(52) 

2 

(7) 

1 H 2 .) = 7.27 

Nonreferred 

19 

3 

(16) 

9 

(47) 

7 

(37) 

p < .05 

Line motor 

Referred 

27 

11 

(41) 

12 

(44) 

4 

(15) 

lH2) = 3.44 

Nonreferred 

19 

3 

(16) 

11 

(58) 

5 

(26) 

p < .09 

Medical 

Referred 

8 

4 

(50) 

3 

(25) 

1 

(9) 

ns 

Nonreferred 

38 

10 

(26) 

20 

(53) 

8 

(21) 



^Chi-square test of significance of association between each referral category (referral vs. nonreferral) and placement in improver, comparison, or decreaser group. 
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TABLE 6. Comparison of Outcome Groups on Family Variables 


Group 

Improver Comparison Decreaser 


Measure 

M 

SD 

M 

SD 

M 

SD 

P 

Income^ 

2.07 

(1.00) 

2.48 

(.93) 

1.56 

(.88) 

F(2, 41) = 3.1,/’<.06 

Family stability*’ 

1.64 

(.74) 

1.77 

(.69) 

2.00 

(.87) 

F(2, 42) = .64, ns 

Parental advocacy*’ 

1.64 

(.50) 

1.73 

(.46) 

1.78 

(.44) 

F(2, 42) = .26, ns 


^Three-point scale, with higher values indicating more favorable values. ^Three-point scale, with lower values indicating more favorable values. 


which did not appear to differ substantially between them- 
selves. A composite measure of special education experience 
was then computed as the proportion of years for which in- 
formation was available in which the student had been placed 
in special education. This measure could be computed for 30 
of the 46 students (because missing information at any one 
age did not remove the student overall) and had values of 0 
through 1.0, with a median value of .50. The three groups dif- 
fered in the expected direction: for improvers, M — .30 (SD = 
.41); for comparison, M = .62 {SD — .43); and for decreasers, 
M — .73 {SD = .30). A univariate analysis of variance yielded 
a nearly significant effect for group, F{2, 27) = 2.10, p < .09. 
To attempt to control for individual differences in early abil- 
ity measures and to provide a more rigorous test of the pre- 
dictive value of special education placement beyond the 
positive factors identified above, a multiple regression was 
conducted, with FIAT total scale score at age 13 as the de- 
pendent variable. The student’s McCarthy GCI score on en- 
tering the preschool program and his or her McCarthy GCI 
score at the end of the first year of preschool were both en- 
tered as the first block; the composite “positive factors” score 
defined above, which captured early child and family charac- 
teristics, was entered second; and the composite measure of 
special education experience was entered as the third and final 
block. The special education measure produced an change 
of .204, T/l, 25) = 7.90, p < .Ol.Thus, even taking into ac- 
count early measures of ability and response to preschool pro- 
gram, along with selected child and family characteristics 
already demonstrated to be associated with outcome, special 
education is significantly and negatively related to academic 
achievement at age 13 (standardized beta coefficient = -.473). 

Discussion 

Identification of Outcome Groups 

The first noteworthy issue is the meaningfulness of categoriz- 
ing participants as improvers or decreasers. The criterion used 
for this purpose, 1.5 standard deviations for the residual of pre- 
diction, is based entirely on variance within the present sam- 


ple. It therefore cannot be interpreted as having any absolute 
significance. At best, we can say that these participants were 
further from the predicted scores than the remainder of the 
sample. However, the finding that this classification is reli- 
ably associated with specific child and family characteristics 
does provide a kind of convergent validity. Furthermore, the 
high stability of FIAT scores from year to year (see Table 3) 
suggests that the classification has substantial test-retest reli- 
ability. 

Strikingly, the three groups did not differ in their age 12 
SBIS-IV composite scores. This difference in outcome clas- 
sification for the academic and cognitive measures cannot be 
attributed to differential reliability; FIAT and SBIS-IV scores 
are predicted about equally well by the preschool tests (see 
Table 2). Instead, the difference between the outcome groups 
is genuinely specific to academic achievement. Thus, the error 
of prediction is not just from early test scores to later acade- 
mic achievement; it also occurs between concurrent cognitive 
and academic measures. The divergence between the two sug- 
gests that there are powerful aspects of experience that affect 
academic achievement relatively independent of IQ scores. 

Improvers, or Less Valid Early Testing? 

The analyses and interpretation offered in the previous section 
are based on the assumption that errors of prediction are pri- 
marily a reflection of changes in achievement of the partici- 
pants over time. An alternative interpretation is that improvers’ 
early test scores underestimated their cognitive functioning, 
which provided a greater scope for improvement. Conversely, 
decreasers’ early test scores may have overestimated their 
functioning. 

This alternative interpretation is unlikely to be respon- 
sible for the main body of results. Improvers were more likely 
to be middle-class European American children of positive 
temperament who were not initially referred for social rea- 
sons. These four factors should have made them easier for the 
(middle class, European American) testers to assess accu- 
rately. Improvers were also more likely to be boys and to have 
been referred for gross motor concerns; these two factors 
could have affected testing accuracy, but they were likely to 
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-• — improvers 
Comparison 
— Decreasers 


# of positive factors 


FIGURE 2. Number of positive factors and outcome status. 



-• — Improvers 
■ - Comparison 

— Decreasers 


Age 


FIGURE 3. School placement experience for three outcome groups. 


be outweighed by the four previous factors. We conclude that 
our classification to some substantial extent reflected devel- 
opmental trajectory. 

Factors Influencing 
Developmental Trajectory 

The present study was originally designed as a program com- 
parison focused on cognitive and academic measures, not as 
a long-term developmental study. In consequence, we have 
only a relatively impoverished set of child and family vari- 
ables; this is especially true for social and personality factors. 


A measure of maternal education would be better than quali- 
fication for free/reduced-cost lunch as a measure of socio- 
economic status, for example. For this reason, these results 
should be viewed as an underestimate, a lower bound, for the 
number and role of relevant child and family characteristics. 
Another limitation to the conclusions is that the first assess- 
ment of the children was at approximately age 4. By this time, 
some of these factors may have already influenced develop- 
ment, reducing the potential error of prediction, that is, again 
leading to an underestimate of effects. 

Despite these limitations, the findings are intriguing in 
several respects. First, there are some identifiable factors re- 
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lated to outcome status, and they are in general plausible ones. 
Some of the specific findings might have been expected, even 
if they are discouraging reminders of the impact of sociocul- 
tural factors (e.g., the role of income/social class and Euro- 
pean American ethnicity). The findings for temperament are 
notably strong and consistent with other research on resili- 
ence: None of the decreasers were viewed as likeable or out- 
going, but half of the increasers were. Initial nonreferral for 
social concerns likely functions in the same way as tempera- 
ment. We did not anticipate the relative advantage for male 
gender, however; in fact, this finding was in opposition to the 
results for gender found in the studies reviewed by Rutter 
(1990), all of which were conducted with at-risk children. It is 
frequently noted that the disproportionate representation of 
boys in samples with language and learning disabilities is much 
greater in clinic-referred samples than in community samples. 
A common explanation for this is that girls who are referred 
for special education services have to have more severe, or 
broader, disabilities in order to be referred. To the extent that 
this is correct, early measures for girls may underestimate the 
degree of their problem and thus fail to signal the lower prob- 
ability of improving later. Further, given the results of Rutter 
et al. (1975) and Werner and Smith (1982), it is surprising that 
neither family stability nor parental advocacy appeared to play 
a role in outcome. It may be that because of this was con- 
ducted in an urban, relatively well-educated setting where 
there was a network of support agencies and institutions, all 
the families had sufficient social support to avoid negative ef- 
fects. Taken together, these findings confirm the expectation 
that although there are some common factors such as positive 
temperament, different constellations of protective factors 
will be found for children with early developmental impair- 
ments. The present sample is being followed until age 18, and 
it is possible that the nature of the predictors will change by 
that time. 

Second, no one factor is predominant; instead, it appears 
to be the conjunction of several factors that make it possible 
to “beat the odds,” as Figure 2 illustrates. All but one of the 
improvers had three or more positive factors, and all but one 
of the decreasers had two or fewer. No one factor comes even 
nearly close to discriminating these two groups as well as the 
whole set does. It is most likely that what all these factors have 
in common is that they influence treatment by parents, teach- 
ers, peers, and others over the years. The combined role of 
nonreferral for social concerns and positive temperament is 
especially interesting in the light of the recent longitudinal 
findings of Caprara, Barbaranelli, Pastorelli, Bandura, and 
Zimbardo (2000). On the basis of structural equations mod- 
eling, they concluded that the strongest third-grade predictor 
of both academic achievement and peer social acceptance in 
the eighth grade was prosocial behavior, based on a combi- 
nation of self-, teacher, and peer ratings. The strength of this 
prediction held up even when third-grade academic achieve- 
ment was controlled statistically. Caprara et al. concluded that 


a prosocial orientation fosters mutually supportive social and 
intellectual relationships with peers and reduces vulnerability 
to depression and aggression, both of which can undermine 
academic motivation. 

The fact that the divergence among the three outcome 
groups was specific to academic achievement highlights the 
specific role of educational experience. This discrepancy be- 
tween cognitive and academic outcomes has been noted in other 
situations, despite the fact that intelligence tests were origi- 
nally developed with the primary goal of predicting academic 
achievement. For example. Hart and Risley (1995) assessed 
29 of the 42 typically developing children who had been in 
their longitudinal study as toddlers when they were 9 to 10 
years of age. They found that age 3 SBIS-IV scores were sub- 
stantially correlated with later standardized tests of language 
development but were not correlated with academic achieve- 
ment in reading, writing, spelling, and arithmetic. We suggest 
that part of the explanation will center on the experiences 
children have at school and the way those experiences are af- 
fected by characteristics of the children. 

The most problematic of our results concerns school 
placement experience. Lower academic achievement, and clas- 
sification as a decreaser, was associated with greater experience 
in special education than was classification as an improver, 
even when controlling for preschool period and family mea- 
sures. Special education placement is potentially both an out- 
come measure and a causal factor in academic achievement, 
and for this reason the interpretation of the association is ex- 
tremely difficult. The present results are entirely consistent 
with the hypothesis that poor academic achievement in the 
early years, beyond that predictable by preschool period mea- 
sures, and especially when accompanied by difficult tempera- 
ment and behaviors, leads to special education placement as 
well as to poor academic achievement at age 13. They are 
equally consistent with the hypothesis that extended time in 
special education limits the growth of academic achievement. 
It is again notable that the groups did not differ in their age 
12 Stanford-Binet scores. This set of results suggests that both 
student motivation and teacher expectation merit further ex- 
amination. 

Conclusions 

As acknowledged earlier, the present study is inherently retro- 
spective. There is a crucial need for long-term, prospective 
outcome studies of children who have been identified as hav- 
ing special needs in early childhood. These studies should set 
academic and cognitive development in a broad social, famil- 
ial, and cultural perspective, in order to identify more compre- 
hensively and more accurately the environmental factors that 
can have a powerful effect on the path of development. We 
believe that the present results do provide strong, if indirect, 
evidence that experience can make a powerful difference. How- 
ever, characteristics of the child and family may have quite 
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unintentional, deleterious effects on the child’s environment. 
The goal of education and other interventions must be to en- 
sure that every child receives the best possible experience, to 
take the upwardly divergent path. Brooks has highlighted the 
role of “islands of competence” in social, mechanical, ath- 
letic, and other domains in building self-esteem, hope, and 
self-discipline, with a number of practical suggestions. Rut- 
ter (1990), drawing on theories of development as well as em- 
pirical research, has offered a particularly valuable summary: 

The limited evidence available so far suggests that 
protective processes include (a) those that reduce the 
risk impact by virtue of effects on the riskiness it- 
self or through alteration of exposure to or involve- 
ment in the risk, (b) those that reduce the likelihood 
of negative chain reactions stemming from the risk 
encounter, (c) those that promote self-esteem and 
self-efficacy through the availability of secure and 
supportive personal relationships or success in task 
accomplishment, and (d) those that open up oppor- 
tunities. . . . Particular attention needs to be paid to 
the mechanisms operating at key turning points in 
people’s lives, when a risk trajectory may be redi- 
rected onto a more adaptive path. (pp. 209-210) 


AUTHORS’ NOTE 

The Longitudinal Comparison Project is an ongoing study conducted 
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Appendix 


The narratives produced by the two primary research project 
staff members for each participant were coded independently 
by other individuals, who were unaware of the outcome group 
to which the participant had been assigned. Three aspects of 
child and family characteristics were coded. 

Temperament 

1 = outgoing, likeable, relates well, easygoing, or 

other positive 

2 = not remarkable, no information, or shy/withdrawn 

3 = this code was not used 

4 = hostile or aggressive 

5 = not hostile or aggressive but otherwise difficult, 

hyperactive, or impulsive 

6 = diagnosed mental disorder or congenital 

syndrome 

Agreement = 73%, Cohen’s kappa = .62 


Family Stability 

1 = high stable family, could be single parent or 

early adoption, if stable afterward 

2 = low/moderate changes, such as single divorce/ 

remarriage, or little or no information available 

3 = several moves or partner changes, includes foster 

care placement, running away from home 

Agreement = 73%, Cohen’s kappa = .59 

Parental Advocacy 

1 = unusually strong, such as getting the child a 

special tutor/counselor or home schooling 

2 = not remarkable or no information 

3 = appears low 

Agreement = 89%, Cohen’s kappa = .76 


